Document Analysis, Pattern Discovery, Information Extraction, Natural Language Processing